Mining Web Logs to Improve Web Caching and Prefetching

نویسندگان

  • Qiang Yang
  • Henry Haining Zhang
  • Ian Tian Yi Li
  • Ye Lu
چکیده

Caching and prefetching are well known strategies for improving the performance of Internet systems. The heart of a caching system is its page replacement policy, which selects the pages to be replaced in a proxy cache when a request arrives. By the same token, the essence of a prefetching algorithm lies in its ability to accurately predict future request. In this paper, we present a method for caching variable-sized web objects using an n-gram based prediction of future web requests. Our method aims at mining a prediction model from the web logs for document access patterns and using the model to extend the well-known GDSF caching policy. In addition, we present a new method to integrate this caching algorithm with a prediction-based prefetching algorithm. We empirically show that the system performance is greatly improved using the integrated approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Web Logs for Actionable Knowledge

Everyday, popular Web sites attract millions of visitors. These visitors leave behind vast amount of Web site traversal information in the form of Web server and query logs. By analyzing these logs, it is possible to discover various kinds of knowledge, which can be applied to improve the performance of Web services. A particularly useful kind of knowledge is knowledge that can be immediately a...

متن کامل

Proxy Side Web Prefetching Scheme for Efficient Bandwidth Usage: Data Mining Approach

users grows, Web traffic continues to increase at an exponential rate and has become one of the major components of Internet traffic. One of the solutions to reduce Web traffic and speed up Web access is Web caching and prefetching. Web prefetching is one of the methods to condense user’s latencies in the World Wide Web professionally. User’s accesses makes it possible to predict future accesse...

متن کامل

Integrating Intelligent Predictive Caching and Static Prefetching in Web Proxy Servers

Web caching and Web prefetching are two important techniques used to reduce the noticeable response time perceived by users. By integrating Web caching and Web prefetching, these two techniques can complement each other since the Web caching technique exploits the temporal locality, whereas Web prefetching technique utilizes the spatial locality of Web objects [32]. In this paper, we develop al...

متن کامل

Long-term Web Prefetching Algorithms: A Comparative Study

User perceived latency has become a potential problem due to the increase in internet traffic. Web caching is an effective means of reducing user perceived latency. Web prefetching is an attractive solution which relies on web caching to reduce access latency. There are two kinds of algorithms that are currently used for prefetching i.e., linear algorithms and data mining algorithms. Web prefet...

متن کامل

Mining Web Logs with PLSA Based Prediction Model to Improve Web Caching Performance

Web caching is a well-known strategy for improving the performance of web systems. The key to better web caching performance is an efficient replacing policy that keeps in the cache popular documents and replaces rarely used ones. When coupled with web log mining, the replacing policy can more accurately decide which documents should be cached. In this paper, we present a PLSA based prediction ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001